Improving Automatic Text Classification by Integrated Feature Analysis
نویسندگان
چکیده
منابع مشابه
Improving Automatic Text Classification by Integrated Feature Analysis
SUMMARY Feature transformation in automatic text classification (ATC) can lead to better classification performance. Furthermore dimen-sionality reduction is important in ATC. Hence, feature transformation and dimensionality reduction are performed to obtain lower computational costs with improved classification performance. However, feature transformation and dimension reduction techniques hav...
متن کاملAutomatic Feature Induction for Text Classification
The Problem: All classifiers require a set of features that can be used to distinguish between different examples. In some cases, such as determining whether a chess position is a winning position, the features are clear (the positions of the chess pieces). In other cases, such as text, they are less clear. A document is simply a string of characters. Standard practice dictates that documents s...
متن کاملImproving Text Classification by Web Corpora
A major difficulty of supervised approaches for text classification is that they require a great number of training instances in order to construct an accurate classifier. This paper proposes a semi-supervised method that is specially suited to work with very few training examples. It considers the automatic extraction of unlabeled examples from the Web as well as an iterative integration of un...
متن کاملClassification of Text, Automatic
Automatic text classification (ATC) is a discipline at the crossroads of information retrieval (IR), machine learning (ML), and computational linguistics (CL), and consists in the realization of text classifiers, i.e. software systems capable of assigning texts to one or more categories, or classes, from a predefined set. Applications range from the automated indexing of scientific articles, to...
متن کاملMulti-domain text-to-speech synthesis by automatic text classification
This paper describes a multi-domain text-to-speech (MD-TTS) synthesis strategy for generating speech among different domains and so increasing the flexibility of high quality TTS systems. To that effect, the MD-TTS introduces a flexible TTS architecture that includes an automatic domain classification module, which allows MD-TTS systems to be implemented by different synthesis strategies and sp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2008
ISSN: 0916-8532,1745-1361
DOI: 10.1093/ietisy/e91-d.4.1101